Likelihood Component Analysis
ثبت نشده
چکیده
Independent component analysis (ICA) is popular in many applications, including cognitive neuroscience and signal processing. Due to computational constraints, principal component analysis is used for dimension reduction prior to ICA (PCA+ICA), which could remove important information. The problem is that interesting independent components (ICs) could be mixed in several principal components that are discarded and then these ICs cannot be recovered. To address this issue, we propose likelihood component analysis (LCA), a novel methodology in which dimension reduction and latent variable estimation is achieved simultaneously by maximizing a likelihood with Gaussian and non-Gaussian components. We present a parametric LCA model using the logistic density and a semi-parametric LCA model using tilted Gaussians with cubic B-splines. We implement an algorithm scalable to datasets common in applications (e.g., hundreds of thousands of observations across hundreds of variables with dozens of latent components). In simulations, our methods recover latent components that are discarded by PCA+ICA methods. We apply our method to dependent multivariate data and demonstrate that LCA is a useful data visualization and dimension reduction tool that reveals features not apparent from PCA or PCA+ICA. We also apply our method to an experiment from the Human Connectome Project with state-of-the-art temporal and spatial resolution and identify an artifact using LCA that was missed by PCA+ICA. We present theoretical results on identifiability of the LCA 1 model and consistency of our estimator. Proofs are in a web supplement.
منابع مشابه
Constructing and Evaluating the Reliability and Validity of the Professional Identity of Teachers Questionnaire
Introduction: Teachers'professional identity is the core of their teaching activity and professional commitment, and its discovery is very important for improving curricula and students learning. The present study was conducted to design and evaluate the validity and reliability of the teachers'professional identity questionnaire. Methods: This is a sequential combined exploratory study. The ...
متن کاملIntrinsic Dimension Estimation by Maximum Likelihood in Probabilistic PCA
A central issue in dimension reduction is choosing a sensible number of dimensions to be retained. This work demonstrates the asymptotic consistency of the maximum likelihood criterion for determining the intrinsic dimension of a dataset in a isotropic version of Probabilistic Principal Component Analysis (PPCA). Numerical experiments on simulated and real datasets show that the maximum likelih...
متن کاملThe Possibility of Created the Vegetation Cover Maps in the Central Zagros Forest by Using the IRS Satellite Image
The preparation of vegetation cover maps by used the land inventory and a traditional method has a lot of cost and time. But today, remote sensing is one of the main sources of data collection and information production for study and monitoring land resources, and was efficient tools for providing quickly and timely data and information needs for program planning in the natural resource filed. ...
متن کاملLand Cover Classification Using IRS-1D Data and a Decision Tree Classifier
Land cover is one of basic data layers in geographic information system for physical planning and environmentalmonitoring. Digital image classification is generally performed to produce land cover maps from remote sensing data,particularly for large areas. In the present study the multispectral image from IRS LISS-III image along with ancillary datasuch as vegetation indices, principal componen...
متن کاملThe Possibility of Created the Vegetation Cover Maps in the Central Zagros Forest by Using the IRS Satellite Image
The preparation of vegetation cover maps by used the land inventory and a traditional method has a lot of cost and time. But today, remote sensing is one of the main sources of data collection and information production for study and monitoring land resources, and was efficient tools for providing quickly and timely data and information needs for program planning in the natural resource filed. ...
متن کاملMixtures of Principal Component Analyzers
Principal component analysis (PCA) is a ubiquitous technique for data analysis but one whose effective application is restricted by its global linear character. While global nonlinear variants of PCA have been proposed, an alternative paradigm is to capture data nonlinearity by a mixture of local PCA models. However, existing techniques are limited by the absence of a probabilistic formalism wi...
متن کامل